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1 Introduction 

1.1 Overview and main results 

In the paper |19| . written in collaboration with Gesine Reinert, we proved several univer- 
sality results, involving sequences of random vectors whose components have the form of 
finite homogeneous sums based on sequences of independent random variables. Roughly 
speaking, our main finding implied that, in order to study the normal approximations of 
homogeneous sums (and under suitable moment conditions) it is always possible to replace 
the original sequence with an i.i.d. Gaussian family. The power of this approach resides in 
the fact that homogeneous sums associated with Gaussian sequences are indeed elements of 
the so-called Wiener chaos, so that normal approximations can be established by means of 
the general techniques developed in [HI [22l [23] - that are based on a powerful interaction 
between standard Gaussian analysis, Malliavin calculus (see e.g. [21]) and Stein's method 
(see e.g. [6]). Moreover, in the process one always recovers uniform bounds over suitable 
classes of smooth functions. 

The aim of this paper is to introduce these techniques into the realm of random matrix 
theory. More specifically, our goal is to use the universality principles developed in [H], in 
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order to prove the forthcoming Theorem 11.11 which consists in a multidimensional central 
limit theorem (CLT) for traces of non-Hermitian random matrices with i.i.d. real-valued 
entries. More precisely, let X be a centered real random variable, having unit variance and 
with finite moments of all orders, that is, E{X) = 0, = 1 and < oo for every 

n ^ 3. We consider a doubly indexed collection X = {Xij : i,j ^ 1} of i.i.d. copies of X. 
For every integer X ^ 2, we denote by X]y the N x N random matrix 

Xr,= l^:z,j = l,...,N], (1.1) 



and by Tr(-) and X^, respectively, the usual trace operator and the kth power of X^ 
Theorem 1.1 Let the above notation prevail. Fix m ^ 1, as well as integers 
1 ^ ki < . . . < km- 



Then, the following holds, 
(i) As N oo, 



(Tr(X^i) - E [Tr(X^^)] , . . . , Tr(X^-) - E [Tr(X 



Law 



{Zki,---, Zk^), (1.2) 



where Z = {Z^ : k ^ 1} denotes a collection of real independent centered Gaussian 
random variables such that, for every k ^ 1, E{Zl) = k. 

(ii) Write (3 = E\X\'^. Suppose that the function ip : ^ M is thrice differentiable and 
that its partial derivatives up to the order three are bounded by some constant B < oo. 
Then, there exists a finite constant C = C{j3, B,m, ki, km) , not depending on N, 
such that 



E 



Tr(X^^) - ^[Tr(X^^)] Tr(X^'-) - E[Tr(X^-)]. 



Var(Tr(X^^)) 



-E 



Var(Tr(X^'")) 

Zk^ Zk„ 



^1 



kr, 



(1.3) 

^ CN-^'\ 



Remark 1.2 1. We chose to state and prove Theorem ll.ll in the case of non-Hermitian 
matrices with real-valued entries, mainly in order to facilitate the connection with 
the universality results proved in [I9]. However, our techniques may be extended to 
the case where the random variable X is complex-valued and with finite absolute 
moments of every order. This line of research will be pursued elsewhere. One should 
also note that, differently from [25], in the present paper we do not use any technique 
coming from complex analysis. 
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2. Fix an integer K ^ 2 and assume that E\X\ < oo, while higher moments are 
allowed to be possibly infinite. By inspection of the forthcoming proof of Theorem 
11.11 one sees that the CLT (11. 2p as well as the bound (11. 3p continue to hold, as long 
as the integers ki, ...,km verify kj ^ K for j = 1, m. 

3. In a similar vein as at the previous point, by imposing adequate uniform bounds on 
moments one can easily adapt our techniques in order to deal with random matrices 
whose entries are independent but not identically distributed. One crucial fact sup- 
porting this claim is that the universality principles of Section [2] hold for collections 
of independent, and not necessarily identically distributed, random variables. 

4. For non-Hermitian matrices, limits of moments are not sufficient to provide an ex- 
haustive description of the limiting spectral measure or of the fiuctuations around it. 
Rather, one would need to consider polynomials in the eigenvalues and their complex 
conjugates. These quantities cannot be represented using traces of powers of X^, so 
that our approach cannot be extended to this case. 

1.2 Discussion 

In this section we compare our Theorem II . II with some related results proved in the existing 
probabilistic literature. 

1. In the paper [25j, Rider and Silverstein proved the following CLT. 

Theorem 1.3 Let X be a complex random variable such that E{X) = E{X^) = 0, 
E{\X\^) = 1, E{\X\'') ^ k ^ 3 (for some a > 0) and Re{X), lm{X) possess a 

joint bounded density. For N ^ 2, let X^ be defined as in Consider the space Ti of 

functions / : C ^ C which are analytic in a neighborhood of the disk \z\ ^ 4 and otherwise 
bounded. Then, as N oo, the random field 



converges in the sense of finite- dimensional distributions (f.d.d.) to the centered complex- 
valued Gaussian field {Z{f) : f G 7i}, whose covariance structure is given by 



Here, i] = {z ^ £. : \z\ ^1] is the unit disk, and d^z/n stands for the uniform measure on 
V (in other words, d^z = dxdy for x, y G M such that z = x + iy). 

By using the elementary relations: for every integers n, m ^ 0, 



{Tr(/(X;v))-i^[Tr(/(X^))] : f e H} 




(1.4) 
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one sees that our Theorem 11.11 can be reformulated by saying that 

{Tr(/(X;v)) - E [Tr(/(X^))] : / G Pol(C)} {Z{f) : / G Pol(C)}, (1.5) 

where the covariance structure of {Z{f) : / G Pol(C)} is given by I\1A\\ . It follows that 
Theorem 11.11 roughly agrees with Theorem 11.31 however, we stress that the framework of 
[25] is different from ours, since the findings therein cannot be applied to the real case due 
to the assumption that real and imaginary parts of entries must possess a joint bounded 
density. In addition, also note that (differently from [25]) we do not introduce in the 
present paper any requirement on the absolute continuity of the law of the real random 
variable X, so that the framework of our Theorem II . II contemplates every discrete random 
variable with values in a finite set and with unit variance. 

2. One should of course compare the results of this paper with the CLTs involving 
traces of Hermitian random matrices, like for instance Wigner random matrices. One 
general reference in this direction is the fundamental paper by Anderson and Zeitouni 
[3], where the authors obtain CLTs for traces associated with large classes of (symmetric) 
band matrix ensembles, using a version of the classical method of moments based on graph 
enumerations. It is plausible that some of the findings of the present paper could be also 
deduced from a suitable extension of the combinatorial devices introduced in [^ to the case 
of non-Hermitian matrices. However, proving Theorem 11.11 using this kind of techniques 
would require estimates for arbitrary joint moments of traces, whereas our approach merely 
requires the computation of variances and fourth moments. Also, the findings of [3] do not 
allow to directly deduce bounds such as (II. 3p . We refer the reader e.g. to Guionnet [O] 
or to Anderson et al. [2j, and the references therein, for a detailed overview of existing 
asymptotic results for large Hermitian random matrices. 

3. The general statement proved by Chatterjee in [HI Theorem 3.1] concerns the nor- 
mal approximation of linear statistics of random matrices that are possibly non-Hermitian. 
However, the techniques used by the author require that the entries can be re-written as 
smooth transformations of Gaussian random variables. In particular, the findings of [5] do 
not apply to discrete distributions. On the other hand, the results of [5] also provide uni- 
form bounds (based on Poincare-type inequalities and in the total variation distance) for 
one-dimensional CLTs. Here, we do not introduce any requirements on the absolute con- 
tinuity of the law of the real random variable X, and we get bounds for muiti-dimensional 
CLTs. 

4. Let us denote by {\j{N) : j = 1,...,N} the complex-valued (random) eigenvalues 
of Xtv, repeated according to their multiplicities. Theorem 11.11 deals with the spectral 
moments of Xat, that are defined by the relations: 
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where denote the spectral measure of X^r. Recall that 



1 ^ 



(1.7) 



where 5z{-) denotes the Dirac mass at z, and observe that one has also the alternate 
expression 



N 



Y 



(1.8) 



«l,...,ifc=l 



It follows that our Theorem 11.11 can be seen as a partial (see Remark 11.21 (4) above) 
characterization of the Gaussian fluctuations associated with the so-called circular law, 
whose most general version has been recently proved by Tao and Vu: 

Theorem 1.4 (Circular law, see [29j) LetX be a complex-valued random variable, with 
mean zero and unit variance. For N ^ 2, let be defined as in Then, as N oo, 

the spectral measure yUxjv converges almost surely to the uniform measure on the unit disk 
\] = {z G C : \z\ ^ 1} . The convergence takes place in the sense of the vague topology. 

To see why Theorem 1 1 . II concerns fluctuations around the circular law, one can proceed 
as follows. First observe that, since E{X^) = 1 and E^X"^) < cxo by assumption, one can 
use a result by Bai and Yin [H Theorem 2.2] stating that, with probability one, 



limsup max |Aj(A^)| ^ 1. 



(1.9) 



Now fix a polynomial p{z). Elementary considerations yield that, since f ll.91) and the 
circular law are in order, with probability one 



l-TripiX^)) ^ - [ piz)d'z = piO). 



(1.10) 



On the other hand, it is not difficult to see that, for every k ^ 1 and as iV — > oo, 

1 



E 



z dfixj^iz) 



E 



N 



Tr(X 







(one can use e.g. the same arguments exploited in the second part of the proof Proposition 
13.11 below). This implies in particular, for every complex polynomial p, 



E 



^Tr(p(X„)) 



p(0) = - / p{z)d^z. 

Jv 



(1.11) 



By (11.101) and (II. lip , one has therefore that the quantities j^Tt(p{Xn)) and E{jjTr{p{Xiy))) 
both converge to p(0), and (II. 5p ensures that, for sufficiently large, the difference 



Tr(p(X^)) - iVp(O) - [E (Tr(p(X^))) - Np{0)] 
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has approximately a centered Gaussian distribution with variance ^ \p'{z)\'^d'^z. Equiva- 
lently, one can say that the random variable j^Tt{p{X^)) tends to concentrate around its 
mean as goes to infinity, and Hl.bfi describes the Gaussian fluctuations associated with 
this phenomenon. 

On the other hand, one crucial feature of the proof of the circular law provided in 
[29] is that it is based on a universality principle. This result basically states that, under 
adequate conditions, the distance between the spectral measures of (possibly perturbed) 
non-Hermitian matrices converges systematically to zero, so that Theorem 11.41 can be es- 
tablished by simply focussing on the case where X is complex Gaussian (this is the so-called 
Ginibre matrix ensemble, first introduced in |i3j). It is interesting to note that our proof of 
Theorem ll.ll is also based on a universality result. Indeed, we shall show that the relevant 
part of the vector on the LHS of (11.21) (that is, the part not vanishing at infinity) has the 
form of a collection of homogeneous sums with fixed orders. This implies that the CLT 
in p.2p can be deduced from the results established in [I9], where it is proved that the 
Gaussian Wiener chaos has a universal character with respect to Gaussian approximations. 
Roughly speaking, this means that, in order to prove a CLT for a vector of general ho- 
mogeneous sums, it is sufficient to consider the case where the summands are built from 
an i.i.d. Gaussian sequence. This phenomenon can be seen as a further instance of the 
so-called Lindeberg invariance principle for probabilistic approximations, and stems from 
powerful approximation results by Rotar' |27j and Mossel et al. [17J. See the forthcoming 
Section [2] for precise statements. 

5. We finish this section by listing and discussing very briefly some other results related 
to Theorem 11.11 taken from the existing probabilistic literature. 

- In Rider [2lj (but see also Forrester [10]), one can find a CLT for (possibly discon- 
tinuous) linear statistics of the eigenvalues associated with complex random matrices 
in the Ginibre ensemble. This partially builds on previous findings by Costin and 
Lebowitz [7]. 

- Reference |26], by Rider and Virag, provides further insights into limit theorems 
involving sequences in the complex Ginibre ensemble. In particular, one sees that re- 
laxing the assumption of analyticity on test functions yields a striking decomposition 
of the variance of the limiting noise, into the sum of a "bulk" and of a "boundary" 
term. Another finding in |26| is an asymptotic characterization of characteristic 
polynomials, in terms of the so-called Gaussian free Geld. 

- Finally, one should note that the Gaussian sequence Z in Theorem 11.11 also appears 
when dealing with Gaussian fiuctutations of vectors of traces associated with large, 
Haar-distributed unitary random matrices. See e.g. [8| and [9] for two classic refer- 
ences on the subject. 
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1.3 Proof of Theorem HJJ: the strategy 

In order to prove (11 .2^ (and (11.31) as well), we use an original combination of techniques, 
which are based both on the universality results of [19J and on combinatorial considerations. 
The aim of this section is to provide a brief outline of this strategy. 

For ^ 1, write [A^] = {1, A^}. For ^ 2, let us denote by dJJ^ the collection of 
all vectors i = {ii, . . . G [A^]'^ such that all pairs (?„, ia+i), a = 1, . . . ,k, are different 
(with the convention that ik+i = ii), that is, i G D^^^ if and only if {ia,ia+i) 7^ {%,'ib+i) 
for every a ^ h. Now consider the representation given in (II. 8p and, after subtracting the 
expectation, rewrite the resulting expression as follows: 

Tr(X^) - E [Tr(X^)] 

N 

il,...,ifc=l 

A^ 2 ^ ^ Xi-^i^Xi^i^ • • • Xi^i-^ 

+A^ 2 ^ [Xi^i^Xi^i^ ■ ■ ■ Xi^i^ - E[Xi^i^Xi^i^ ■ ■ ■ Xi^i^]). (1.13) 

Our proof of (|1.2p is based on the representation (|1.12l) - p.l3l) . and it is divided in two 
(almost independent) parts. 



I. In Section [3l we shall prove that the following multi-dimensional CLT takes place for 
every integers 2 ^ ki < ... < km'. 



N 



i=l 



(fcl) 



■ ■ ■ 1 -f* / J ^«l«2^22«3 



\ 



ieD 



(fern) 



Law 



(1.14) 



I 



for Z = {Zj : i ^ 1} as in Theorem II. 1[ In order to prove 01. 14^ . we apply the univer- 
sality result obtained in [H] (and stated in a convenient form in the subsequent Section 
[2|). This result roughly states that, in order to show p.l4p in full generality, it is sufficient 
to consider the special case where the collection X = {Xij : i,j ^ 1} is replaced by an 
i.i.d. centered Gaussian family G = {Gij : i,j ^ 1}, whose elements have unit variance. 
In this way, the components of the vector on the LHS of (I1.14p become elements of the 
so-called Gaussian Wiener chaos associated with G: it follows that one can establish the 
required CLT by using the general criteria for normal approximations on a fixed Wiener 
chaos, recently proved in [HI [22], [23] . Note that the results of [THl[22l[23j can be described 
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as a "simplified method of moments": in particular, the proof of (11.141) will require the mere 
computation of quantities having the same level of complexity of covariances and fourth 
moments. 

II. In Section [H we shall prove that the term (11.131) vanishes as oo, that is, for 

every ^ 2, 

N 2 (Xj^jjXjjjg ■ ■ ■ — ElXi-^i^Xi^i:^ ■ ■ ■ Xj^jj) -^0 in L'^{Q). 

(1.15) 

The proof of (11.151) requires some subtle combinatorial analysis, that we will illustrate 
by means of graphical devices, known as diagrams. Some of the combinatorial arguments 
and ideas developed in Section [H should be compared with the two works by Geman [m[T2] . 

Then, the upper bound ( ll.3p will be deduced in Section [441 from the estimates obtained 
at the previous steps. 

The rest of the paper is organized as follows. In Section [2] we present the universality 
results proved in [19], in a form which is convenient for our analysis. Section [3] contains a 
proof of ( I1.14p . whereas Section [4] deals with p. 151) . 

2 Main tool: universality of Wiener chaos 

In what follows, every random object is defined on an adequate common probability space 
(f2, P). The symbols E and 'Var' denote, respectively, the expectation and the variance 
associated with P. Also, given a finite set B, we write \B\ to indicate the cardinality of B. 
Finally, given numerical sequences a^^bN, N ^ 1, we write ^ whenever /h^ 1 
as ^ CX3. 

We shall now present a series of invariance principles and central limit theorems involv- 
ing sequences of homogeneous sums. These are mainly taken from [H] (Theorem 12. 2p . [23] 
(Theorem 12. 4p and |22j (Theorem 12. 6p . Note that the framework of [19] is that of random 
variables indexed by the set of positive integers. Since in this paper we mainly deal with 
random variables indexed by pairs of integers (i.e., matrix entries) we need to restate some 
of the findings of [IHj in terms of random variables indexed by a general (fixed) discrete 
countable set A. 

Definition 2.1 (Homogeneous sums) Fix an integer k ^ 2. Let Y = {Ya : a E A} 

be a collection of square integrable and centered independent random variables, and let 
f : A^ M he a symmetric function vanishing on diagonals (that is, /(ai,...,afc) = 
whenever there exists k ^ j such that = aj), and assume that / has finite support. The 
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random variable 

Q,(/,Y) = Yl /(ai,...,afe)F,,---y,, = fc!/(ai, a,)F,, ■ ■ ■ F,, 

ai,...,ak&A {ai,...,afe}cA*= 

(2.16) 

is called the homogeneous sum, of order k, based on / and Y. Clearly, E[Qk{f,Y)] = 
and also, if E(Y^) = 1 for every a E A, then 

Emf,Yr] = k\\\f\\i (2.17) 

where, here and for the rest of the paper, we set 

11/11^ = E 

ai,...,af;^A 

Now let G = {Ga : a G A} be a collection of i.i.d. centered Gaussian random variables 
with unit variance. We recall that, for every k and every /, the random variable Qk{f, G) 
(defined according to (12 .16^ ) is an element of the fcth Wiener chaos associated with G. See 
e.g. Janson [15] for basic definitions and results on the Gaussian Wiener chaos. The next 
result, proved in [Ej, shows that sequences of random variables of the type Qk{f, G) have 
a universal character with respect to normal approximations. The proof of Theorem 12.21 
is based on a powerful interaction between three techniques, namely: the Stein's method 
for probabilistic approximations (see e.g. [6|), the Malliavin calculus of variations (see e.g. 
[2T]). and a general Lindeberg-type invariance principle recently proved by Mossel et al. 
in [17]. 

Theorem 2.2 (Universality of Wiener chaos, see |19] ) Let G = {Ga : a E A} be a 

collection of standard centered i.i.d. Gaussian random variables, and fix integers m ^ 1 and 
ki, km ^ 2. For every j = 1, m, let {/^■* : N 1} be a sequence of functions such that 
/^^ : A^^ —>■ R is symmetric and vanishes on diagonals. We also suppose that, for every 
j = l,...,m, the support of fl^\ denoted by supp(/^^), is such that |supp(/^'')| oo, 
as oo. Define <5fcj(/w'\G), N ^ 1, according to ^2.1 61) . Assume that, for every 

j = I, ...m, the following sequence of variances is bounded: 

E[Qk^{f^^\Gn AT^l. (2.18) 

Let V be a mxm non-negative symmetric matrix, and let V) indicate a m- dimensional 

centered Gaussian vector with covariance matrix V. Then, as N ^ oo, the following two 
conditions are equivalent. 

(1) The vector {Qkj if '-3 = l,---,fn} converges in law to cyKn{0,V). 

(2) For every sequence X = {Xa : a G A} of independent centered random variables, with 
unit variance and such that snp^ElXaf < oo, the law of the vector {Qfe^ (/]^\ X) : 
j = l,...,m} converges to the law o/o/l^(0, V^). 
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Note that Theorem 12 . 21 concerns only homogeneous sums of order k ^ 2: it is easily seen 
(see e.g. [IS, Section 1.6.1]) that the statement is indeed false in the case k = 1. However, 
if one considers sums with a specific structure (basically, verifying some Lindeberg-type 
condition) one can embed sums of order one into the previous statement. A particular 
instance of this fact is made clear in the following statement, whose proof (combining the 
results of [I9] with the main estimates of p^) is standard and therefore omitted. 

Proposition 2.3 For m ^ 1, let the kernels {/^^ : N 1}, j = l,...,m, verify the 
assumptions of Theorem \2.Sl Let {ai : i ^ 1} be an infinite subset of A, and assume 
that condition (1) in the statement of Theorem \2.^ is verified. Then, for every sequence 
X = {Xa '■ a G A} of independent centered random variables, with unit variance and such 
that sup (J < oo, as N —>■ oo the law of the vector {Wn ; Qujif^^ X) : j = 1, m}, 

where Wn = '^Yld=i^ai, converges to the law of {Nq; Nj : j = l,...,m}, where Nq ~ 
^(0, 1), and {Ni, N^) ~ ^/^(O, V) denotes a centered Gaussian vector with covariance 
V, and independent of Nq. 

Theorem 12.21 and Proposition 12.31 imply that, in order to prove a CLT involving vectors 
of homogeneous sums based on some independent sequence X, it suffices to replace X with 
an i.i.d. Gaussian sequence G. In this way, one obtains a sequence of random vectors 
whose components belong to a fixed Wiener chaos. We now present two results, showing 
that proving CLTs for this type of random variables can be a relatively easy task: indeed, 
one can apply some drastic simplification of the method of moments. The first statement 
deals with multi-dimensional CLTs and shows that, in a Gaussian Wiener chaos setting, 
componentwise convergence to Gaussian always implies joint convergence. See also pLj for 
some connections with Stokes formula. 

Theorem 2.4 (Multidimensional CLTs on Wiener chaos, see |19L 123]) Let the fam- 
ily G = {Ga : a ^ A] be i.i.d. centered standard Gaussian and, for j = l,...,m, define 
the sequences Qa: (Z]^'', G), N ^ 1, as in Theorem \2.2 (in particular, the functions f^^ 
verify the same assumptions as in that theorem). Suppose that, for every i,j = 1, m, as 



where V is a mx m covariance matrix. Finally, assume that Wn, N ^ 1, is a sequence of 
o/r(0, 1) random variables with the representation 



where the weights wn{cl) are zero for all but a finite number of indices a, and J2aeA '^N^ciY = 
1 . Then, the following are equivalent: 

(1) The random vector {Wn ', Qkjifj^K G) : j = 1, ...,m} converges in law to {Nq ; Nj : 
j = 1, m}, where Nq ~ ^(0, 1), and {Ni, Nm) ~ ^yKn{0, V) denotes a centered 
Gaussian vector with covariance V, and independent of Nq. 



N 



oo 




(2.19) 
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(2) For every fixed j = l,...,m, the sequence Qfc^ (Z^'', G), N 1, converges in law to 
Z ~ -^(O, V^(j, j)), that is, to a centered Gaussian random variable with variance 
V{j,j). 

The previous statement implies that, in order to prove CLTs for vectors of homoge- 
neous sums, one can focus on the componentwise convergence of their (Gaussian) Wiener 
chaos counterpart. The forthcoming Theorem 12.61 shows that this type of one-dimensional 
convergence can be studied by focussing exclusively on fourth moments. To put this result 
into full use, we need some further definitions. 

Definition 2.5 Fix k ^ 2. Let / : — > M be a (not necessarily symmetric) function 
vanishing on diagonals and with finite support. For every r = 0, k, the contraction f-kj-f 
is the function on yl2d-2r gjygj^ j-^y 

f-krfiai,...,a2d-2r) (2.20) 

)f{ak-r+l, a2d-2r, Xl, ...,Xr). 

{xi,...,Xr)€A^- 

Observe that (even when / is symmetric) the contraction f-krf is not necessarily symmetric 
and not necessarily vanishes on diagonals. The canonical symmetrization of f-krf is written 

f^rf- 

Theorem 2.6 (The simphfied method of moments, see [22| ) Fix k ^ 2. Let G = 

{Ga : a G A} be an i.i.d. centered standard Gaussian family. Let {f^ '■ N ^ 1} be a 
sequence of functions such that f^ : ^ M. is symmetric and vanishes on diagonals. 
Suppose also that |supp(/iv)| oo, as N oo. Assume that 

E[QkifN,GY]^ >0, asN^oo. (2.21) 

Then, the following three conditions are equivalent, as N ^ oo. 

(1) The sequence QkifNi G), N ^ 1, converges in law to Z ^ ^(0,cr^). 

(2) E[QkifN,Gy]-*3a\ 

(3) For every r = 1, k-1, II/at^^ /Af||2A:-2r 0. 



Finally, we present a version of Theorem 12.21 with bounds, that will lead to the proof 
of Theorem ll.ll (ii) provided in Section [4741 

Theorem 2.7 (Universal bounds, see ll9j) Let X = {Xa : a E A} be a collection of 
independent centered random variables, with unit variance and such that (3 := sup^ < 
oo. Fix integers m 1, km > ■■■ > ki ^ 2. For every j = 1, ...,m, let f^^^ : A^^ — > M 6e 
a symmetric function vanishing on diagonals. Define Q^(X.) := Qkj{f^-'\'^) according to 
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^2.16\) . and assume that -^[(^-'(X)^] = 1 for all j = 1, . . . ,m. ^4/50, assume that K > is 
given such that ^^^^ maxi^j<jm Infa(/*^-'^) ^ K, where 



Inf,(/(^))= Yl f^'\a,a2,...,a,f = — 

|a2,...,afc jcA J 



a2,...,afe,eA 



Let if : ^ R 5e a thrice differentiable function such that \\(p"\\ao + ||(/?'"||oo < oo, 
with ilv^^^^lloo = max|„|=fc i sup^g]j„ \d"(p{z)\. Then, for Z = {Z\ . . . , Z"") ~ ^(0, /„) 
(standard Gaussian vector on W"^), we have 



\E[^{Q\X),...,Q"^iX))]-E[^iZ)]\^y"\\^[J2^n + '^ E 



+Ky'"\\oo\f3 + \- 



where Aij, 1 ^ i ^ j ^ m, is given by 

fc,-i 



j=i 



max max Infa 



^ - 1)' (t- 1 ) ( r - 1 ) V^^^ + ^.-2r)!(||/« ..... f^%r + \\f^^ /(^^Ih.) 



We finish this section by a useful result, which shows how the influence Inf^/ of / : 
^ M can be bounded by the norm of the contraction of / of order k — 1: 

Proposition 2.8 Let f : M. be a symmetric function vanishing on diagonals. Then 

(/c - l)!maxlnfa(/) := max /(a, as, • • • , a.)^ ^ ||/ .fc-i /Ih- 



a2,...,afceA 



Proof. We have 



II/**-! /Ill 



a,beA 



E f{a,a2,...,ak)fib,a2,...,ak) 
E /2(a,a2, . . . ,afe) 

.a2,.--,afe6^ 



^ max 

a£A 



E /^(a,a2, . . . ,afc) 

.a2,.--,afeeA 

^ 2 

(A;-l)!maxInf,(/)' 
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As a consequence of Theorem 12.71 and Proposition EIH we immediately get the following 
result. 

Corollary 2.9 Let X = {Xa : a & A} be a collection of independent centered random 
variables, with unit variance and such that (3 := sup^i?|Xap < oo. Fix integers m ^ 1, 
km > ••• > ki ^ 1. For every j = 1, ...,m, let {/^^ : N ^ 1} be a sequence of functions such 
that f^^ : A'^^ is symmetric and vanishes on diagonals. Define (^^^(X) := QkjifN^, 

according to 112. 16\) . and assume that E[Q]y(X)2] = lforaUj = l,...,m and N ^ 1. Let 
ip : ^ M be a thrice dijferentiable function such that \\'^"\\oc + ||v^'"||oo < oo. If for 
some a > 0, ||/^'* *A;j_r fN^\\2r = 0{N~°') for all j = 1, . . . ,m and r = 1, . . . ,kj — 1, then, 
by noting {Z^,...,Z"^) a centered Gaussian vector such that E[Z^Z^] = if i ^ j and 
E[{Z^y] = 1, we have 

\E[^{Q],{X), Q-(X))] - E[^{Z\ Z^)] I = 0(iV""/2). 

3 Gaussian fluctuations of non-diagonal trace compo- 
nents 

Our aim in this section is to prove the multidimensional CLT (11.141) . by using the uni- 
versality results presented in Section [2l To do this, we shall use an auxiliary collection 
G = {Gij : 2, j ^ 1} of i.i.d. copies of a ^(0, 1) random variable. 

As in Section [LSI for a given integer k ^ 2, we write D)^ to indicate the set of vectors 
i = {ii, . . . ,ik) € [A^]'^ such that all the elements {ia,ia+i), a = I, ■ ■ ■ ,k, are different in 
pairs (with the convention that ik+i = ii). We have the following preliminary result: 

Proposition 3.1 For any fixed integer k ^ 2, 

Ar-^/2 J2 ■ ■ ■ G^,^^ ^Zk^ ^(0, k) asN-^oo. 

Remark 3.2 When k = 1, the conclusion of the above proposition continues to be true, 
since in this case we obviously have 

N 
1=1 

Proof of Proposition \3.1\ The main idea is to use the results of Section [2l in the special 
case A = N^, that is, A is the collection of all pairs (i, j) such that i,j ^ 1. Observe that 

jY k/2 Gi-^i^ . . . Gi^i^ = Qkifk,N, G), 
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with fk^N '■ ([^]^)'^ ~^ ^ the symmetric function defined by 

fk,N = ry fl^l, (3-22) 



where we used the notation 

f\^}i{{0'iM),- ■ ■ ,{a-k,hk)) = N'^/"^ ^ l{v(i)=ai,v(i)+i=6i} • • • l{v(fc)=afe,v(fc)+i=fefc}, (3.23) 



(k) 



and &k denotes the set of all permutations of [k]. Hence, by virtue of Theorem 12.61 to 
prove Proposition 13.11 it is sufficient to accomplish the following two steps: (5*^61^ 1) prove 
that property (3) (with fk^N replacing /at) in the statement of Theorem 12.61 takes place, 
and {Step 2) show that relation (I2.2ip (with fk^N replacing fjq) is verified. 

Step 1. Let r G {1, . . . , A; — 1}. For cr, r G 6^, we compute 

fk}j /lyi((a;i, l/l), . . . , {X2k-2r, 2/2fe-2r)) (3.24) 
_ ]\J~^ \ ^ -1 -1 

^* / J -'-{«<t(1)=^1i«<t(1) + 1=?/i} • • • ^{'i'cj{k-r)=^k-r,ia{k-r) + l=yk-r} 

^'^{3T{l)=^k-r+l,iT{l) + l=yk-r + l} ' ' ' {j T(k-r)=^2k-2r , j T{k-r) + l=y2k-2r} 

^ ■'■{V(fc-r + l)=iT(fc-r + l)i«<T(fc-r + l) + l=iT(fc-r + l) + l} ' ' ' { V(fc) (fc) i V(fc) + 1 (fc) + l } ' 

We now want to assess the quantity ||/fc°/v*r/fc^Arll2fc-2r- To do this, we exploit the represen- 
tation (I3.24P in order to write such a squared norm as a sum over ([A^]'^)^: as a consequence, 

one deduces that \\fi%^r /fcJ|lL-2r ^ Wn'''^^ ^"^'^ where F^,"'"'"^ is the subset of {[Nff 
composed of those quadruplets (i,j,a, b) such that 

^o-(l) = Cto-(l), = Ctcr(l)+1, -I ia{k-r) = O-aik-r), V(A;-r)+l = 0'a{k-r)+l 

jV(l) = br{l), Jr(l)+1 = K{1)+1, ■ ■ • 5 jrik-r) = K{k~r), jr(fc-r)+l = ^r(fc-r)+l 
iaik-r+l) = jrik-r+l), V(A;-r+l)+l = jr(fc-r+l)+l , • • • , ^crik) = jrik), '*o-(fc)+l = jT(fc)+l 
'^(T(fe-r+l) = br{k-r+l)i (^cr{k-r+l)+l = &r(fe-r+l)+l) • • • j (^cr{k) = Kik), 0.a{k)+l = ^T(fc)+1- 

(3.25) 

It is immediate that, among the equalities in (13.251) . the 2k equalities appearing in the 
forthcoming display (I3.26P are pairwise disjoint (that is, an index appearing in one of the 
equalities does not enter into the others): 

V(l) = «o-(l), • • • , ia{k-r) = «cr(A:-r)) jV(l) = &r(l), • • • , jrik-r) = K{k-r) 

V(fc-r+l) = jr(fc-T-+l), • • • , V(fc) = jrik), «(T(fc-r+l) = K{k-r+l), ■ ■ ■ , 0,cr{k) = K(k)- 

(3.26) 

Hence, the cardinality of F^'"^'"^^ is less than A^^'^, from which we infer that H/fc^Ar^f /fc^ivll2fc-2r 
is bounded by 1. This is not sufficient for our purposes, since we need to show that 
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Il/fc°/v '^r /i'jvll2A:-2r tends to zeio as ^ cxD. To prove this, it is sufficient to extract 
from (13.25P one supplementary equality which is not already written in (13 .26^ . We shall 
prove that this equality exists by contradiction. Set L = {cr(s) : 1 ^ s ^ k — r} and 
R = {<7{s) + 1: 1 ^ s ^ k — r} (with the convention that A; + 1 = 1). Now assume that 
R = L. Then cr(l) + 1 E R also belongs to L, so that cr(l) + 2 G i?. By repeating this 
argument, we get that L = R = [k], which is a contradiction because r ^ 1. Hence, R^ L. 
In particular, the display (13.251) implies at least one relation involving two indices that are 
not already coupled in (13.261) . This yields that the cardinality of F^'^''^^ is at most A^^'^~^, 
and consequently that H/^'^^r /fc^Ar|l2fc-2r ^ N~'^. This fact implies immediately that the 
norms H/at /Ar||2fe-2r, r = 1, . . . , - 1, verify 

||/iV^r-/iv||2fe-2r = 0(iV-V2), ^g^y) 

and tend to zero as N —>■ oc. In other words, we have proved that condition (3) in the 
statement of Theorem 12.61 is met. 

Step 2. We have 



Var 



For fixed i, j G -Oj^'*, observe that the expectation E[Gi^i^ . . . Gi^i^Gj^j^ . . . Gjj.jJ can only 
be zero or one. Moreover, it is one if and only if, for all s G [/c], there is exactly one t G [k] 
such that {is,is+i) = {jt,jt+i)- In this case, we define a G as the bijection of [k] into 
itself which maps each s to the corresponding t and we have, for all s G [k], 

is = 3a(s) = 3a(s-l) + l- (3.28) 

To summarize, one has that Var (^"^/"^ X^ig^C') Gi^i^ . . . Gi^i^ equals 

N-^ J2 |{(iJ) e (D^PY : {is,is+i) = {j.is),Ja(s)+i) for all s G [k]}\. (3.29) 

U a E &k is such that a{s) = a{s — 1) + 1 for all s (it is easily seen that there are exactly k 
permutations verifying this property in &k), we get k different conditions by letting s run 
over [k] in (13.281) . so that 

{(i,j) G {D^^^Y : {is,is+i) = Ua{s), J a{s)+i) for all s G [A;]} ~ N\ as N oo. 

In contrast, if a G ©a: is not such that a{s) = a{s — 1) + 1 for all s, then by letting s run 
over [fc], one deduces from (|3.28l) at least fc + 1 different conditions, so that, in this case, 

{(i,j) G (Z^S^)2 : (z„«,+i) = {j.is),Ms)+i) for all s G [fc]} = o(iV'=), as iV ^ oo. 
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Taking into account these two properties together with the representation (13.291) . we deduce 
that the variance of 



ie-D 



(fc) 



tends to as — > oo. It follows that the required property fl2.2ip in Theorem 12.61 (with 
cr^ = k) is met. 

The proof of Proposition 13. II is concluded. ^ 

The multidimensional version of Proposition 13.11 reads as follows: 
Proposition 3.3 Fix m ^ 1, as well as integers km > ■ ■ ■ > ki ^ 2. Then, as N oo, 
( 

iV-'/'J^G,,, iV-^ ■ ■ ■ . . . (3.30) 



V 



1=1 



(fcl) 



A^-^ V G G • 



(km) 



Law 



J 



where Z = {Zk : k ^ 1} denotes a collection of independent centered Gaussian random 
variables such that, for every k ^ 1, E{Zl) = k. 

Proof. It is an application of Theorem 12.41 in the following special case: 

- WN{i,j) = if i = j ^ N and WN{i,j) = otherwise; 

- \^ is equal to the diagonal matrix such that V{a,b) = if a ^ b and V{a,a) = ka, 
for a = 1, m; 



- for j = 1, ...,m, /^^ = fkj^N, where we used the notation (I3.22p . 

Indeed, in view of Proposition [3Tl one has that condition (2) in the statement of Theorem 
12.41 is satisfied. Moreover, for fixed a 7^ 6 and since G consists of a collection of independent 
and centered (Gaussian) random variables, it is clear that, for all A^, 



E 



ieD 



so that condition (12.19p is met. The proof is concluded. 



□ 



By combining Proposition 13.31 and Proposition 12.31 we can finally deduce the following 
general result for non-diagonal trace components. 
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Corollary 3.4 For N 2, let be the N x N random matrix given by where 
the reference random variable X has mean zero, unit variance and finite absolute third 
moment. Fix m ^ \, as well as integers 2 ^ ki < . . . < km- Then, the CLT fil.l4\ ) 
takes place, with Z = {Zk : k ^ 1} denoting a sequence of independent centered Gaussian 
random variables such that, for every k ^ 1, E{Zl) = k. 

Remark 3.5 In order to prove Corollary 13.41 one only needs the existence of third mo- 
ments. Note that, as will become clear in the following SectionSl moments of higher orders 
are necessary for our proof of ( 11.150 . 



4 The remainder: combinatorial bounds on partitioned 



chains and proof of Theorem 11.1 



Fix an integer k ^ 2. From section 11.31 recall that Z)^^ denotes the subset of vectors 
i = (ii, ... ,4) G [A^]^ such that all the elements {ia,ia+i), « = I, ■ ■ ■ ,k, are different in 
pairs (with the convention that ik+i = ii). From the Introduction, recall that X is a 
centered random variable, having unit variance and with finite moments of all orders. Let 
also X = {Xij : i,j ^ 1} be a collection of i.i.d. copies of X. In the present section, our 
aim is to prove f ll.lSp . that is 



Proposition 4.1 For every k ^ 2, as N ^ oo 

( \ 

Var 



fe/2 ^ \Xi^i^ . . . Xi^i^ - E{Xi-^i^ . . . Xi^i^ 



0{N-^). (4.31) 



The proof of Proposition [4]T] is detailed in Section [441 and builds on several combinatorial 
estimates derived in Sections I4.2H4. 31 To ease the reading of the forthcoming material, we 
now provide an intuitive outline of this proof. 

Remark on notation. Given an integer ^ 2, we denote by V{k) the collection of all 
partitions of [k] = {l,...,k}. Recall that a partition n G V{k) is an object of the type 
71 = {Bi, ...,Br}, where the -B/s are disjoint and non-empty subsets of [k], called blocks, 
such that Uj=i^,,,^rBj = [k]. Given a,xE [k] and vr G 'P(fc), we write a ~ x whenever a and 
X are in the same block of tt. We also use the symbol 1 to indicate the one-block partition 
1 = {[k]} (this is standard notation from combinatorics - see e.g. [28]). In this section, for 
the sake of simplicity and because k is fixed, we write instead of D^^^ 



N ■ 



4.1 Sketch of the proof of Proposition [47T] 

Our starting point is the following elementary decomposition: 

neQ(k) 
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where Q{k) stands for the collection of all partitions of [k] containing at least one block 
of cardinality ^ 2, and A^^tc) is the collection of all vectors i G [A^]'^ such that the 
equality {ia,ia+i) = (ix^ix+i) holds if and only if a ~ x. Using this decomposition, one 
sees immediately that, in order to show (14.311) . it is sufficient to prove that, for each Gxed 
TT G Q{k), the quantity 

VarliV-^-/2 J2 [X,,,,...X,,,,-E(X,,,,...X,,,J] j (4.32) 

= N ^ [ E{^iii2 ■ ■ ■ ^ikh^Oih ■ ■ ■ ^jkji) ~ E{^hi2 ■ ■ ■ Xikh)E{^hh ■ ■ ■ ^jkji) ] 

(iJ)eAjv(7r)xAAr(7r) 

is 0{N~^), as — > CX3. Let G'Ar(vr) denote the subset of pairs (i,j) G v4iv(7r) x An{tt) such 
that the following non-vanishing condition is in order: 

E{Xi^i^ . . . Xi^i^Xj^j^ . . . Xj^j^) - E{Xi^i^ . . . Xi^i^)E{Xj-j^ . . . Xj^j^) ^ 0. (4.33) 

Hence 

VarliV-'^/^ [X,,,,...X,,,,-i5;(X,,,,...X,,,J] j (4.34) 

\ iGAjv(7r) J 

= iv~^ ^ [E{Xi^i^ . . . Xi^i^Xj^j^ . . . Xj^j^) - E{Xi^i^ . . . Xi^i^)E{Xj^j^ . . . Xj^jJ ] . 

(i,j)eGjv(7r) 

Due to the finite moment assumptions for X, and by appling the generalized Holder in- 
equality, it is clear that, for a generic pair (i, j), 

\E{Xi-^i2 . . . Xi^i^Xj^j^ . . . Xj^j^)-E{Xi^i^ . . . Xi^i^)E{Xj^j2 . . . Xj^jJ I ^ 2 £'(|Xp^) < oo. 

It follows that, in order to prove that the sum in (I4.34p is 0(X~^), it is enough to show 
that 

\GN{n)\ ^Q{k,n)N''~\ (4.35) 

for some constant 9(fc,7r) not depending on X. Our way of proving (14.351) is to show that, 
if (i, j) denotes a generic element of G'Ar(7r), then, necessarily, there exists at least k + 1 
equalities between the 2k indices ii, . . . ,ik, ji, . . . ,jk of (i, j). Note that by 'equality' we 
just mean the existence of two different integers a, 6 G [k] such that ia = ib or ja = jb, or 
the existence of two integers a,b E [k] such that ia = jb- Proving this fact implies that 
the 2k indices of a generic elements (i, j) of G'Ar(7r) have at most k — 1 degrees of freedom 
(see Point 7 of Section 4.2 for a precise definition), so that ( 14.351) holds immediately — the 
constant Q{k, vr) merely counting the number of ways in which the k + 1 equalities can be 
consistently distributed among the indices composing (i, j). In order to extract these k + 1 
equalities between the 2k indices of a generic element (i,j) of Gj\f{n), we will consider two 
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cases, according as the partition tt G Q{k) contains at least one singleton or not. 

Case A: No singletons in n. By definition of An^h), and due to the absence of singleton 
in TT, we already see that there are at least k/2 or {k + l)/2 (according to the evenness of 
k) equalities between the k indices of i (resp. j). Moreover, the non- vanishing condition 
(I4.33P implies that there is at least one further equality between one index of i and one 
index of j. So, we proved the existence oi k + 1 equalities between the 2k indices of (i, j), 
and the proof of (I4.35P in the Case A is done. 

Case B: At least one singleton in n. Let S denote the collection of the singleton(s) of vr. In 
order for (I4.33P to be true, observe that, for all s e S", we must have {js,js+i) = (^a,^a+i) 
for some a G [k]. In particular, this means that there exist \S\ equalities of the type js = ia 
for the indices composing (i, j). Also, by definition of the objects we are dealing with, for 
all t G [/c] \ S, we must have {it, it+i) = {ia, ia+i) for some a, different from t, in the same 
TT-block as t. Of course, the same must hold with i replaced by j. Hence, in order for f l4.35p 
to be true, it remains to produce one equality between indices that has not been already 
considered. We mentioned above that for all t E [k]\ S, there exists a, different from t and 
in the same block as t, such that jt = ja- Hence, to conclude it remains to show that we 
have jt = ja for at least one integer t belonging to [k] \ S and one integer a not belonging 
to the same block as t. Since, by assumption, tt contains at least one singleton and one 
block of cardinality ^ 2 (indeed, vr G Q{k)), without loss of generality (up to relabeling 
the indices according to a cyclic permutation of [k]), we can assume that S contains the 
singleton {k}. Consider now the singleton {s*} of S, where s* is defined as the greatest of 
the integers m such that {m} is adjacent from the right to a block, say Bu*, of cardinality 
^ 2. For a particular example of this situation, see the diagram in Fig. [H where each row 
represents the same partition of [7] having s* = 6 (see Point 3. in the subsequent Section 
14.21 for a formal construction of diagrams) . To finish the proof, once again we split it into 
two cases: 

Case Bl: The block Bu* contains two consecutive integers. This assumption implies that 
jx = jt = jt+i for all x,t E Bu*. Since {a} is adjacent from the right to Bu*, we have 
ja = jt for all t G Bu*, which is exactly what we wanted to show. 

Case B2: The block B^* does not contain two consecutive integers. Fig. [7] is an illus- 
trative example of such situation, where each row represents the same partition of [8], 
with s* = 7. As we see on this picture, we have necessarily jj = js, yielding the desired 
additional equality, which could not be extracted from the previous discussion. In Section 
14.31 it is shown that this line of reasoning can be extended to general situations. 

Remark 4.2 The sketch given above contains all the main ideas entering in the proof 
of Proposition 14. 1[ The reader not interested in technical combinatorial details, can then 
go directly to Section 14.41 where the proof of Theorem 11.11 is concluded. The subsequent 
Sections I4.2H4.3I fill the gaps of the above sketch, by providing exact definitions as well as 
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complete formal arguments leading to the estimate (14.311) . 



4.2 Definitions 

In the following list, we introduce some further definitions that are needed for the analysis 
developed in the rest of this section. 

1. Fix integers N,k ^ 2. A chain c of length 2k, built from [A^], is an object given 
by the juxtaposition of 2k pairs of integers of the type 

c = {iui2){i2,k)--{ik,ii){ji,j2){j2,j3)--{jk,ji), (4.36) 

where ia,jx € [N], for a,x = 1, k. The class of all chains of length 2k built from [A^] is 
denoted by C{2k, N). As a notational convention, we will use the letter i to write the first 
k pairs in the chain, and the letter j to write the remaining ones. For instance, an element 
of C(6, 5) (that is, a chain of length 6 built from the set {1, 2, 3, 4, 5}) is 

c=(l,5)(5,l)(l,l)(3,3)(3,3)(3,3), 

where = 1, ^2 = 5, is = 1, ji = j2 = js = 3. According to the graphical conventions 
given below (at Point 3 of the present list) we will sometimes say that (ii, i2)(i2, ^3)---(^fc, ^i) 
and (ji, J2)(j2, is) ■■■{jk,ji) are, respectively, the upper sub-chain and the lower sub-chain 
associated with the chain c in (14.361) . For instance, in the previous example the upper 
sub-chain is (1, 5)(5, 1)(1, 1), whereas the lower one is (3, 3)(3, 3)(3, 3). We shall say that 
(i;,z/+i) is the Ith pair in the upper sub-chain of c (and similarly for the elements of the 
lower sub-chain). We shall sometimes call ia the left index of the pair {ia,ia+i)- Also, we 
use the convention 4+i = H and jk+i = ji- Of course, a chain is completely determined 
by the left indices of its pairs. 

2. Let 71 E V{k) be a partition of [k]. Recall that, for a,b E [k], we write a ~ 6 to 
indicate that a and b belong to the same block of vr. We say that a chain c as in (I4.36P 
has partition n if, for every a,b E [k], the following double implications take place: (i) 
{ia, ia+i) = {ib, ib+i) if and only if a ~ 6, and (ii) {ja,ja+i) = {jb,jb+i) if and only if a ~ 6. 
In other words, a chain has partition vr if and only if the partitions of [k] induced by the 
identical pairs in its upper and lower sub-chain are both equal to vr, that is (with the 
notation of Section HHI), if and only if (zi, i^), (ji, j^) E An^ti). For instance, take 
k = 4 and vr = {{1,3}, {2,4}}. Then, the following chain built from [3] has partition vr: 

c=(l,2)(2,l)(l,2)(2,l)(3,l)(l,3)(3,l)(l,3). 

Note the 'only if part in the definition given above, implying that, if a chain has partition 
71 and if x and y are not in the same block of vr, then necessarily {ix,ix+i) 7^ ih^h+i) 
and {jx,jx+i) 7^ Uyyjy+i)- This yields in particular that a chain cannot have two different 
partitions. 
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3. Given /c ^ 2, we shall sometimes represent a generic chain with partition vr G V{k) 
by means of diagrams. These diagrams are mnemonic devices composed of an upper row 
and a lower row, of k dots each. These rows represent, respectively, the upper and lower 
sub-chain of a given chain, in such a way that the Ith dot (from left to right) in the upper 
(resp. lower) row corresponds the Ith pair in the upper (resp. lower) sub-chain. Each block 
B of the partition tt is represented by two closed curves: the first one is drawn around 
the dots of the upper row corresponding to the pairs {ia, ia+i) verifying a E B; the second 
one is drawn around the dots of the lower row corresponding to those {jx,jx+i) verifying 
X E B. The resulting diagram is the superposition of two identical combinations of dots 
and curves. Note that the shape of the diagram does not depend on A^. For instance, the 
diagram in Fig. [T] corresponds to the case /c = 7, and vr = {{1, 4, 5}, {2}, {3}, {6}, {7}}|^ 
whereas the diagram in Fig. [2] corresponds to A; = 6 and the one-block partition 1 = {[6]}. 




Figure 1: a chain with a five-block partition 




Figure 2: a chain with a one-block partition 

4. In general, given a chain c as in (I4.36P with partition tt = {Bi, Br} as at Point 2 
of the present list, we shall say the the block B^ of the upper sub-chain corresponds to 
the block By of the lower sub-chain, whenever {ia,ia+i) = {jx,jx+i) for every a E Bu and 
every x E By. Note that one given block By in the upper sub-chain cannot correspond to 
more than one block in the lower sub-chain. For vr = S^} € 'P(fc), we shall now 

define a class of chains C.„{2k,N) C C{2k,N), whose elements have partition vr and are 
characterized by two facts: the associated upper and lower sub-chains have at least one 

chain with partition n as in Fig. [T]is 

c = (1, 1)(1, 2)(2, 1)(1, 1)(1, 1)(1, 3)(3, 1)(1, 1)(1,4)(4, 1)(1, 1)(1, 1)(1, 5)(5, 1). 
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pair in common, and "no singletons are left on their own". Formally, the class C-„{2k,N) 
is defined as follows (recall that we use the letter i for the elements of the upper sub- 
chain, and the letter j for the elements of the lower sub-chain), (i) If ^ 2 for every 
t = 1, r, then CT,{2k, N) is the collection of all chains of partition tt verifying that there 
exists a, x e [k] such that the block Ba in the upper sub-chain corresponds to the block 
in the lower sub-chain, (ii) If tt contains at least one singleton, then C^(2/c, A?") is the 
collection of all chains of partition tt such that every singleton in the upper (resp. lower) 
sub-chain corresponds to a block of the lower (resp. upper) subchain, that is: for every 
{a} G TT, there exists -u = 1, ...,r such that {ia,ia+i) = for every / G Bu, and, for 

every {x} G tt, there exists v = l,...,r such that {jx,jx+i) = {js,js+i) for every s G By. 
For instance, if A; 3 and tt = {[3]}, then one element of C-„-{6, 5) is 

c=(5,5)(5,5)(5,5)(5,5)(5,5)(5,5). 

If A; = 6 and TT = {{1, 2, 3}, {4}, {5}, {6}}, then one element of C^(12, 5) is 

c = (1, 1)(1, 1)(1, 1)(1, 2)(2, 5)(5, 1)(2, 2)(2, 2)(2, 2)(2, 5)(5, 1)(1, 2). 

5. Fix k,N ^ 2, as well as a partition tt = {Bi, Br} G V{k). Given two subsets 
U,V C [r] such that \U\ = \V\, let R : U ^ V : u t-^ R{u) be a bijection from U 
onto V. We shall denote by C^{2k, N) the subset of CT,{2k, N) composed of those chains 
c G CT,{2k, N) such that the block Bu in the upper sub-chain corresponds to the block 
Br{u) in the lower sub-chain. When U = {u} and V = {v} are singletons, we shall simply 
write C^''"(2A;, N) to indicate the set of those c G CT^{2k, N) such that the block B^ in 
the upper sub-chain corresponds to the block By in the lower sub-chain. For instance, the 
chain 

ci = (1,1)(1,1)(1,2)(2,5)(5,1)(2,2)(2,2)(2,5)(5,1)(1,2) 

is an element of C^(10, 4), where tt = {B^, B2, ^3, B^} = {{1, 2}, {3}, {4}, {5}}, U ^V^ 
{2,3,4}, and R{2) = 4, R{3) = 2 and i?(4) = 3. The chain 

C2 = (3,3)(3,3)(3,3)(3,3) 

belongs to CH(4,3), where i = {Bi} = {[2]}. Note that the definition of C^(2A;,iV) does 
not give any information concerning the blocks of the upper and lower sub-chains that do 
not belong, respectively, to the domain and the image of R. In other words, for a chain 
c G C^{2k, N), one can have that the block By in the upper sub-chain corresponds to the 
block By in the lower sub-chain even liu and v . For instance, the chain 

c = (1, 1)(1, 1)(1, 2)(2, 5)(5, 1)(1, 1)(1, 1)(1, 2)(2, 5)(5, 1) 

is counted as an element of C^(10,4), where 

TT = {Si, S2, S3, B,} = {{1, 2}, {3}, {4}, {5}}, 
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U = V = {2,3,4}, and R{u) = u, for u = 2,3, 4. 



6. Fix k,N ^ 2, as well as a partition n = {Bi, Br} G Vi^k). Given a bijection 
R : U ^ \^ as at Point 5 above, we shall represent a generic element of the class C^{2k, N) 
by means of a diagram built as follows: first (i) draw the diagram associated with the class 
C^(2A;, A^), as explained at Point 3 of the present list, then (ii) for every pair of blocks B^ 
and B^ such that u & U , v & V and v = R{u) (note that i?„ is in the upper sub-chain, and 
B^ in the lower sub-chain), draw a segment linking a representative element of B^ with a 
representative element of B^. For instance, the class C^(10, A^), associated with the chain 
ci appearing at Point 5 above, is represented by the diagram appearing in Fig. [3l whereas 
the chain C2 is associated with the class C^^(4, 3), whose diagram is drawn in Fig. [H 



Figure 3: a chain with three pairs of corresponding singletons 




Figure 4: a chain with two corresponding blocks 

7. Fix k,N ^ 2 and let C C C{2k, N) be a generic subset of C{2k, N). Let g = 1, 2k 
be an integer. We say that C has at most q degrees of freedom (or, equivalently, that 
C has at most q free indices) if there exists two subsets D,E C [k] such that \D\ ^ 1 
and the following two properties are verified: (i) \D\ + \E\ ^ q, and (ii) for ever}0 xd = 
{xa '■ a E D} E [A^]l^l and every ue = {yb '■ b E E} E [N]^^^, there exists at most one 
chain c as in (I4.36P such that ia = Xa for every a E D and jb = Ub for every b E E. Note 
that our definition contemplates the possibility that E = (/), and in this case the role of 
= is immaterial. In other words, the class C has at most q degrees of freedom if every 
c G C is completely determined by those ia in the upper sub-chain such that a E D and 
those jb in the lower sub-chain such that b E E. For instance, it is easily seen the class 
C{2k,N) has (exactly) 2k degrees of freedom. Another example is the diagram in Fig. 
[HI which corresponds to the case A; = 6, vr = {{1, 2}, {3, 5}, {4, 6}} and u = v = 1. One 

''As indicated by our notation, we regard xd and ue as vectors, respectively in [A^]'^' and [A^]''^', by 
endowing D and E with the natural ordering induced by the ordering on [k]. 




23 



sees that, for every A^, specifying ii, ^4 and completely identifies a chain inside the class 
Cl'^{12, N), which has therefore three degrees of freedomjfl 




Figure 5: a class with three degrees of freedom 

The proof of the two (useful) results contained in the next statement is elementary and 
omitted. 

Lemma 4.3 Fix k,N ^2. 

(1) Let q = I, 2k. Assume that a generic class C C C{2k, N) has at most q degrees of 
freedom. Then, \C\ ^ N'^ . 

(2) Let i = {[k]} be the one-block partition of [k]. Then, the class Ci{2k,N) contains 
only "constant" chains of the type fi4-36\ ) such that {ii,i2) = {ia,ia+i) = {jx,jx+i), 
for every a = 2, k and every x = 1, k. It follows that |Cj(2fc, A^)| = A^. 

Lemma His] will be used in the subsequent section. 
4.3 Combinatorial upper bounds 

We keep the notation introduced in the previous section. The following statement, which 
is the key element for proving Proposition 14.11 contains the main combinatorial estimate 
of the paper. 

Proposition 4.4 Fix k,N ^ 2, and let n = {Bi, ...,Br} £ 'P{k) be a partition containing 
at least one block of cardinality ^ 2. Let the class CT^{2k,N) be defined as at Point 4. of 
the previous section. Then, there exists a finite constant 0(/c,7r) ^ 0, depending only on k 
and TT (and not on N ), such that 

\C^{2k, N) I ^ e(A:, tt) X A^'^-^ (4.37) 

Proof. We shall consider separately the two cases 

A. For every f = 1, r, \B^\ ^ 2. 

B. The partition vr contains at least one singleton. 

^Indeed, one has necessarily that ii ^ 12 — H — is, = ji = J2 = is = jb, H = *6 and ji — je- 
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Case A. When k = 2,3, the only partition meeting the needed requirements is 1. According 
to Lemma [1731 - (2). |Cj(2A;,A^)| = A^, so that the claim is proved, and we shall henceforth 
assume that /c ^ 4. Start by observing that r ^ k/2. Moreover, the class Cni2k,N) 
contains only chains such that at least one block in the upper sub-chain corresponds to a 
block in the lower sub-chain, which yields in turn that 

r 

C^{2k,N)= U C:^\2k,N), 

u,v=l 

where we adopted the notation introduced at Point 5. of Section 14.21 This implies the 
crude estimate 

r 

\C^{2k,N)\ ^ Yl \Cr{2k,N)\. (4.38) 

u,v=l 

According to Lemma [4731 - (1). it is now sufficient to prove that each class C'!^''"{2k,N) has 
at most 2r — 1 degrees of freedom: indeed, (14.38!) together with the fact that 2r — 1 ^ A; — 1 
would imply relation (14.371) . with 6(fc, tt) = ^ fc^/4. Fix u,v E {1, ...,r}. To prove that 
C^'^{2k, N) has at most 2r — 1 degrees of freedom, we shall build two sets D,E C [k] as 
follows. For every s = 1, ...,r, choose an element of the block Bs, and denote this element 
by Qs- Then, define 

D = {as.s = l,...,r}, E = D\{a^}, 

where '\' denotes the difference between sets. We now claim that, for every xd = {xa '■ 
a E D} E [N]^^^ and every Ue = {Vb '■ b E E} E [A^]'-^', there exists at most one chain 
c E C^''"{2k, N) as in (14.360 such that ia = Xa for every a E D and jb = Ub for every b E E. 
To prove this fact, suppose that such a chain c exists, and assume that there exists another 
chain 

c' = (z;,4)(4,z;,)...(z1,^;)(j;,jO(j2,j;)-(jLji) 

verifying this property and such that c' E C^^''" {2k , N) . The following hold: (a) for every 
s = 1, ...,r and every a E Bg, one has that i'^ = Xa^ = ia,, = ia, (b) for every s ^ v and 
every aE Bs, j'a = Va, = ja, = ja and (c) foTS = v and every a E B^, 

■I ■/ •/ • ■ ■ 

As a consequence, c' = c. Since + =2r — 1, this concludes the proof of Proposition 
[Qin the Case A. 

Case B . We shall denote by S the collection of the singleton (s) of vr, that is the subset of 
[k] composed of those indices a such that {a} E n. Note that 15*1 > by assumption. We 
also write P for the collection of the indices u E [r] such that |-B„| ^ 2. Note that P is 
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a subset of [r], whereas S C [k]. Note also that the set [r]\P is the collection of all those 
V G [r] such that B^j is a singleton. Clearly, 




By exploiting the cyclic nature of sub-chains, we can always assume, without loss of gener- 
ality, that S contains the singleton {k}. Since P is not empty, this entails that there exists 
at least one singleton of vr that is adjacent from the right to a block of cardinality at least 
two. Formally, this means that there exists s* E S and u* E P such that s* — 1 G B^*. We 
shall distinguish two cases 

Bl. The block B^* contains two consecutive integers. 

B2. The block B^* does not contain two consecutive integers. 
{Proof under Bl.) The situation of Bl is illustrated in Fig. [H where = 9, 

TT = {B,, Bj} = {{1}, {2}, {3, 6, 7}, {4}, {5}, {8}, {9}}, 
and one can take s* = 8, u* = 3, and the two consecutive integers in B^* are 6 and 7. 
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Figure 6: a singleton is adjacent to a 3-block with two consecutive elements 

Since each element of C-„{2k,N) is such that every singleton in a given sub-chain corre- 
sponds to a block in the opposite sub-chain, we have that 

a(2fc,iV)= [jC^i2k,N), (4.39) 

Ren 

where we adopted the same notation as at Point 5. of Section [421 and the union runs over 
the class TZ of all bijections R : U ^ V such that both U and V contain the set [?"]\-P, and 
every pair (m, R{u)) is such that at least one of the two blocks B^ and -Bij(„) is a singleton. 
This entails the estimate 

\C^{2k,N)\ ^ 5^|C,^(2A;,iV)|. (4.40) 

Ren 

To conclude the proof, we shall show that every class C^{2k, N) appearing in f l4.40p has 
at most k — 1 degrees of freedom: indeed, this fact together with Lemma [1731 - (1) yields the 
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desired conclusion (I4.37p . with the constant 0(A;,7r) = \1Z\ (note that the definition of TZ 
does not depend on N) . To prove that C^{2k, N) has at most k — 1 degrees of freedom, 
we define two sets D,E C [k] as follows. For every s = 1, ...,r, choose an element of the 
block Bg, and denote this element by a^. Then, define 

D = {as:s = l,...,r}, E = D\ {{au^} U {as : s E [r]\P}} . 

In other words, E is obtained by subtracting from D the singleton(s) and the representative 
element of the block Bu*, that is, of the block adjacent to {s*}. We now want to prove 
that, for every Xd = {xa '■ a E D} E [N]^^^ and every Ue = {vt '■ b E E} E [A^]'^', there is 
at most one chain c E C^{2k, N) as in (14.361) such that ia = Xa for every a E D and jb = i/b 
for every b E E. To show this, assume that such a chain c exists, and suppose that there 
exists another chain 

c = (i'l , ^2 ) («2 , ^3) • • • , h h ) ( J2 > is ) • • • U'k 

verifying this property and such that c' E C^{2k,N) and c' 7^ c. By construction of the 
sets D and E, all the indices composing the upper chain are completely determined by 
the choice of x^), whereas the choice of i/e determines the indices such that either x is 
a singleton or x E B^ for some block B^ of cardinality ^ 2 and such that v u*. This 
entails in turn that, necessarily since c' 7^ c, one has that j'^ 7^ j^: for every x E B^*. This 
is absurd. Indeed, since Bu* contains two consecutive integers, one has that j'^ = j'^^^ and 
jx = jx+1 for every x E Bu*; it follows that, since {s*} is adjacent from the right to B^* 
and therefore s* — 1 E Bu* , 

■/ ■/ ■/ ■ ■ ■ 

Jx Js* — 1 Js* Vs* Js* Js* — 1 Jxy 

which is indeed a contradiction. Since 

\D\ + \E\=r+\P\-l^ + 1^1 + -l = k-l, 

the proof is concluded. 

{Proof under B2.) Since Bu* does not contain two consecutive integers and \Bu*\ ^ 2, we 
deduce the existence of a block Bu E n, which is different from Bu* and {s*}, enjoying the 
following "interlacement property": there exists an integer a E [k] such that a+1 < s* — 1, 
a E Bu* and a + 1 E Bu. The block Bu can be either a singleton or a block with two or 
more elements. This situation is illustrated in Fig. [3, corresponding to the case k = S 
and TT = {fii, B,} = {{1, 2}, {3, 5}, {4, 6}, {7}, {8}}. Here, s* = 7, Bu* = B^ = {4, 6}, 
Bu = B2 = {3, 5} and a = 4. 

The crucial remark is now that, for a chain c as in (14.361) with partition vr, one has that 
is* = ia+i- Indeed, a and s* — 1 both belong to Bu*, and therefore {is*-i,is*) = {ia,ia+i)- 
Since a + 1 E B^, this fact yields in particular that, = is* for every x E Bu, that is, 
the left indices associated with Bu are completely determined by the choice of ig* . By the 
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Figure 7: a singleton is adjacent to a 2-block with no consecutive elements 

same argument, one shows that jg* = ja+i- The rest of the proof is similar to the case 
Bl. First, we observe that the representation (I4.39p . with 71 defined exactly as for Bl, 
continues to be true, from which we deduce the estimate (14 .40^ . It is now sufficient to show 
that each class C^{2k, N) has at most k — 1 degrees of freedom. To do this, one chooses a 
representative element from each block Bg G tt, noted a^, and then defines the sets 

D = {a, : s = 1, r, s ^ u}, E = D\{as : s e [r]\P} , 

that is, D is built by selecting one element from each block of tt, except for Bu, and E is 
obtained by subtracting from D all the remaining indices a such that {a} is a singleton of 
TT. One has that 

\D\ + \E\^k-l. (4.41) 

Indeed, \D\ = r — 1 = \P\ + \S\ — 1 ^ -^^^ + 15*1 — 1, and then one has to consider two cases: 
either (a) B^ is a singleton, from which it follows that \E\ = \D\ — {\S\ — 1) ^ ^"J"^^ , or (b) 
Bu is not a singleton, yielding \E\ = \D\ — \S\ ^ ^^-^ — 1- In these two cases, (14.411) is 
then in order. To conclude, it remains to show that, for every Xd = {xa ■ ci ^ D} E [A^]'^' 
and every -jje = {yt '■ b G E} G [-/V]'-^', there is at most one chain c G C^{2k,N) as in 
(I4.36P such that ia = Xa for every a E D and jb = Ub for every b E E. To see this, assume 
that such a chain c exists, and observe that, due to the above considerations, the choice of 
xn completely determines the upper sub-chain of c, as well as those indices j^, in the lower 
sub-chain such that {x} is a singleton of tt or (whenever B^ is not a singleton) such that 
X G Bu. Since the remaining left indices in the lower sub-chain of c are determined by the 
choice of He, the claim is proved. In view of ( 14.4ip . this shows that C^{2k, N) has at most 
k — 1 free indices. This concludes the proof of Proposition 14. 4[ 

As an illustration of the above arguments, one can consider the diagram in Fig. [HI that is 
constructed from the situation in Fig. [7] by selecting U = V = {2,3,4,5} and -R(2) = 4, 
-R(3) = 5, -R(4) = 2 and -R(5) = 3. In particular, it is easily seen that fixing Z4, ij and is 
completely identifies a chain c inside the class C^(16,iV), that has therefore three degrees 
of freedom. 

□ 
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Figure 8: a class with three free indices 



4.4 Proofs of Proposition 14.11 and Theorem 11.11 

Proof of Proposition We take up the notation introduced in Section I4.1[ In view 
of Proposition 14.41 in order to prove relation f l4.35p (and therefore Proposition I4.ip . it 
is sufficient to show that, for every tt G each pair (i,j) G Gn{t^) is such that 

the corresponding chain (^i, «2)---(45 j2)---(jfc5 ji) is an element of C.„{2k,N), from 

which one deduces |(j'^r(7r)| ^ |C,r(2A;, A^)| ^ Q{k,7i)N''~^. To show the desired prop- 
erty, it is enough to prove that, for every pair (i,j) G v4jv(7r) x Ajy^ir) such that the 
chain {ii,i2)...{ik,ii){ji,j2)---{jk,ji) is not in C^(2/c,iV), one has that (i,j) ^ G'Ar(7r). 
By definition of C.j,{2k,N), we have to examine two cases. Start by considering a par- 
tition 71 G Q{k) not containing any singleton: if (i,j) G Ajy^n) x Aj^^n) is such that 
{ii,i2)---{ik,ii){ji,j2)---{jk,ji) ^ C^{2k,N), then the random variables Xj^^^^^ indexed by 
the upper sub-chain are independent of those indexed by the lower sub-chain, and conse- 
quently 

Ei^X^^i^ . . . Xi^i^Xj^j^ . . . Xj^j-^) — Ei^Xi-^i^ . . . Xi^i-^^Ei^Xj^j^ . . . Xji^j-^), 

yielding (i, j) ^ Giy{iT). On the other hand, if vr G Q{k) contains a singleton and if (i, j) 
is such that {ii,i2)...{ik,ii){ji, j2)---{jk, ji) ^ C-„{2k,N), then there exists a = l,...,k such 
that Xj^j^^^ or Xj^j^^-^ is independent of all the other variables indexed by the elements of 
the chain. This gives 

E{Xi^i^ . . . Xi^i-^Xj^j^ . . . Xj^j-^) = E^Xij^i^ . . . Xi^i-^)E(^Xj^j^ . . . Xji_j^) = 0, 

thus proving the required property (i, j) ^ G'Ar(7r). The proof is finished. ^ 

Proof of Theorem \l.l\ -(i): By virtue of the representation p.l2p - p.l3p and of Proposition 
14.11 one sees that, for every 2 ^ ki < ... < km, the limit in distribution of the vector 



Tr(X^),Tr(X^O [Tr(X^^)],...,Tr(X^-) 



E [Tr(X^™)] ) 



coincides with the limit in distribution of 



\ 




29 



□ 



Proof of Theorem For the simplicity of exposition, we assume that ki ^ 2, the 

proof when ki = 1 being completely similar and easier. We have, using the notation D 
introduced in the beginning of Section [T73l and using (I1.13I) . 



(k) 

N 



E 



Tr(X^-)-E[Tr(X^^)] 

^ I / , = , • • • > 

Var(Tr(X^^)) 



Tr(X ^-)-E[Tr(X^-) ] 
Var(Tr(X^™)) 

A, _|_ "'Tl 



-E 



where, by writing Var(Tr(X^)) = Cj{N), 
( 



A 



N 



E 



1 V X 



X 



^k^^l1 ■ ■ ■ 1 



i2 ■ ■ ■ -^ik^h 



(km) 



and 



/J 



-E 



N, 



Bn 
E 



(fcm) 



/J 



Tr(X^^) - ^[Tr(X^^)] Tr(X^™) - ^[Tr(X^ 



Var(Tr(X^^)) 



Var(Tr(Xj^™)) 



By combining Corollary 12.91 with the computations made in the proof of Proposition 13.11 
we immediately get that An = 0{N^^^^). For Bn, we can write 



\Bn\ ^ KyWooJ^E 



^ ] (^«li2 • • • -^ikjh B\Xi^i^ . . . Xjj,^ jj]) 



^ i^llV^'llooE 



Var 



\ 



^ ] (^nj2 • • • -^ikjh B\Xi^i^ . . . Xjj,^ jj) . 



{k-i) 



I 



for some constant K not depending on X, so that Bn = 0{N ^/^) = 0{N ^/^) by Propo- 
sition [4]TJ 
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